Co-Training Semi-Supervised Learning for Fine-Grained Air Quality Analysis
نویسندگان
چکیده
Due to the limited number of air quality monitoring stations, data collected are limited. Using supervised learning for fine-grained analysis, that is used predict index (AQI) locations without may lead overfitting in models have superior performance on training set but perform poorly validation and testing set. In order avoid this problem learning, most effective solution increase amount data, study, not realistic. Fortunately, semi-supervised can obtain knowledge from unlabeled samples, thus solving caused by insufficient samples. Therefore, a co-training method combining K-nearest neighbors (KNN) algorithm deep neural network (DNN) proposed, named KNN-DNN, which makes full use samples improve model analysis. Temperature, humidity, concentrations pollutants source type as input variables, KNN DNN learners. For each learner, labeled initial relationship between variables AQI. iterative process, labeling pseudo-sample with highest confidence selected expand The proposed evaluated real dataset stations 1 February 30 April 2018 over region 118° E–118°53′ E 39°45′ N–39°89′ N. Practical application shows has significant effect analysis quality. coefficient determination predicted value true 0.97, better than other models.
منابع مشابه
Deep Air Learning: Interpolation, Prediction, and Feature Analysis of Fine-grained Air Quality
The interpolation, prediction, and feature analysis of fine-gained air quality are three important topics in the area of urban air computing. The solutions to these topics can provide extremely useful information to support air pollution control, and consequently generate great societal and technical impacts. Most of the existing work solves the three problems separately by different models. In...
متن کاملSemi-Supervised Learning of Alternative Splicing Events Using Co-Training
Alternative splicing is a phenomenon that gives rise to multiple mRNA transcripts from a single gene. It is believed that a large number of genes undergoes alternative splicing. Predicting alternative splicing events is a problem of great interest to biologists, as it can help them to understand transcript diversity. Supervised machine learning approaches can be used to predict alternative spli...
متن کاملSemi-supervised Co-training Algorithm Based on Assisted Learning
The classification performance of the learner is weakened when unlabeled examples are mislabeled during co-training process. A semisupervised co-training algorithm based on assisted learning (AR-Tri-training) was proposed. Firstly, the assisted learning strategy was presented, which is combined with rich information strategy for designing the assisted learner. Secondly, the evaluation factor wa...
متن کاملSemi-Supervised Regression with Co-Training
In many practical machine learning and data mining applications, unlabeled training examples are readily available but labeled ones are fairly expensive to obtain. Therefore, semi-supervised learning algorithms such as co-training have attracted much attention. Previous research mainly focuses on semi-supervised classification. In this paper, a co-training style semi-supervised regression algor...
متن کاملSupervised Learning and Co-training
Co-training under the Conditional Independence Assumption is among the models which demonstrate how radically the need for labeled data can be reduced if a huge amount of unlabeled data is available. In this paper, we explore how much credit for this saving must be assigned solely to the extra-assumptions underlying the Co-training model. To this end, we compute general (almost tight) upper and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Atmosphere
سال: 2023
ISSN: ['2073-4433']
DOI: https://doi.org/10.3390/atmos14010143